A Shape-invariant Phase Vocoder for Speech Transformation
نویسنده
چکیده
This paper proposes a new method for shape invariant realtime modification of speech signals. The method can be understood as a frequency domain SOLA algorithm that is using the phase vocoder algorithm for phase synchronization. Compared to time domain SOLA the new implementation provides improved time synchronization during overlap add and improved quality of the noise components of the transformed speech signals. The algorithm has been compared in two perceptual tests with recent implementations of PSOLA and HNM algorithms demonstrating a very satisfying performance. Due to the fact that the quality of transformed signals stays constant over a wide range of transformation parameters the algorithm is well suited for real-time gender and age transformations.
منابع مشابه
Shape-invariant speech transformation with the phase vocoder
This paper proposes a new phase vocoder based method for shape invariant real-time modification of speech signals. The performance of the method with respect voiced and unvoiced signal components as well as some control strategies for the voiced/unvoiced balance of the transformed speech signals will be discussed. The algorithm has been compared in perceptual tests with an implementation of the...
متن کاملReal Time Pitch Shifting with Formant Structure Preservation Using the Phase Vocoder
Pitch shifting in speech is presented based on the use of the phase vocoder in combination with spectral whitening and envelope reconstruction, applied respectively before and after the transformation. A band preservation technique is introduced to contain quality degradation when downscaling the pitch. The transposition ratio is fixed in advance by selecting analysis and synthesis window sizes...
متن کاملSpeech to chant transformation with the phase vocoder
The technique used for this composition is a semi automatic system for speech to chant conversion. The transformation is performed using an implementation of shapeinvariant signal modifications in the phase vocoder and a recent technique for envelope estimation that is denoted as True Envelope estimation. We first describe the compositional idea and give an overview of the preprocessing steps t...
متن کاملSuppression of phasiness for time-scale modifications of speech signals based on a shape invariance property
Time-scale modifications of speech signals, based on frequency-domain techniques, are hampered by two important artifacts which are “phasiness” and “transient smearing”. They correspond to the destruction of the shape of the original signal, i.e. the de-synchronization between the phases of frequency components. This paper describes an algorithm that preserves the shape invariance of speech sig...
متن کاملUsing FFI Interpolator and VQ Quantization for Designing of High Quality 1200 BPS Speech Vocoder
Storaging or transmission of speech signals at very low bit rate is a hot area in the field of speech processing. We used stochastic inter-frame interpolators and vector quantization (VQ) as a new method for developing a high quality 1200 BPS speech vocoder. The objective and subjecgtive test results show that performance of the new vocoder is compairable with 4800 BPS standard vocoders (as CELP).
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010